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ABSTRACT 

Many reference librarians still rely on menual 
searches to access vertical files, ready reference files, and other 
information stored in card files, drawers, and notebooks scattered 
around the reference department. Automated access to these materials 
via microcomputers using database management software may speed up 
the process. This study focuses on how two divisions of the Main 
Library of the Akron-Summit County Public Library — Science and 
Technology, and Philosophy, Religion, and Education — acquire, store, 
distribute, and use reference information, and examines four software 
products to determine which one would best meet these needs. Thirteen 
criteria were established for ranking the software, the most 
important of which were low-cost, user friendliness, 
IBM-compatibility, full-text capability, and searching ease. Such a 
system should be easy to use and low in cost. Four software products 
were selected and tested: askSam 5.0, Info Select 2.0, FoxPro 2.0, 
and QLtA 4.0. Test results favored askSam, although automation can be 
achieved with a number of different products and methods. It is 
concluded that automating reference files could be a worthwhile 
project. (Contains 105 references.) (KRN) 
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ABSTRACT 



Reference departments have been passed over to a certain 
degree during the automation of libraries. While the online 
public-access catalog is available to reference librarians, it is 
just as much a tool for the patrons. 

Reference librarians still rely on manual methods to access 
vertical files, ready reference files, etc. These information 
resources are stored in various card files, drawers, notebooks, 
etc., usually spread about the department. 

It may be more efficient to have automated access to these 
materials via microcomputers using database management software. 
This paper explores what has been done in this area and also some 
of the things that can be done. 

The file structure, contents, and usage patterns of the 
reference files of the Akron-Summit County Public Library, Akron, 
Ohio were studied. System criteria, including low cost, were 
established for automating the files. Four software products 
were selected and tested: askSam 5.0, Info Select 2.0, FoxPro 
2.0, and Q&A 4.0. 

Test results favor askSam, although automation can be 
achieved with a number of different products and methods. The 
conclusion is that automating reference files could be a worth- 
while project given sufficient funding and resources. 
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PREFACE 

The author became interested in this topic while serving a 
practicum at the Akron-Summit County Public Library. Having just 
finished a course in Library Automation, it was somewhat discon- 
certing to see just how much the reference librarians depended on 
manual files and systems. It seemed that with some investigation 
and application of existing technology, the level of patron 
service might be improved and the librarians' burdens lessened. 

All things considered, now that the study is done, available 
funding may be the deciding factor for implementation. The 
technology exists to satisfactorily automate reference files. 
With a little extra effort and money, some of the tasks of the 
reference librarians could be carried out more efficiently. 

The author wishes to thank the Akron-Summit County Public 
Library, Joyce McKnight, David Jennings, and Kaye Averette for 
their assistance and the use of the reference files. 
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I. INTRODUCTION 

Reference departments have been passed over to a certain 
degree during the automation of libraries. While the online 
public^access catalog (OPAC) is available to reference librari- 
ans, it is just as much an end user tool for the patrons 

Reference librarians still rely on manual methods to access 
vertical files, ready reference files, phone lists, local statis- 
tics, community information files, etc. These information 
resources are stored in various files, drawers, rolodexes, note- 
books, etc*, usually spread about the department. Some of these 
files are also indexed manually on cards. For the purposes of 
this paper, these information resources will be referred to as 
reference files. 

When a patron needs information from one or more of these 
sources (as determined by the librarian based on the reference 
interview), the librarian must either have good enough memory 
and/or enough experience to go directly to the correct reference 
file, or elr.e sort through a manual index (or indexes) and then 
go to the file to which the index points. If the information 
sought is not found in the first file> the librarian must proceed 
to the next file and repeat the manual search, etc. 

It could be much quicker and more efficient to have automat- 
ed access to these materials via microcomputers using database 
management software for these reference files. By re-organizing 
and/or linking computerized reference files and indexes, many or 

1 
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all of the files could be searched with a single search state- 
ment. Thus the librarian could save time and effort in informa- 
tion retrieval. If a terminal were available at each reference 
desk, it would also save the librarian the effort of moving to 
each file. (Although it would still be necessary to go vertical 
files to retrieve folders to which the vertical file index 
points ) . 

Answers to "repetitive-type questions posed by a fairly 

predictable patron population" could also be built into these 

2 

databases, freeing staff for more difficult research. Automa- 
tion could also permit printing reports for current awareness, 
bibliographies, and other informational needs. 

This paper will explore what has been done in this area and 
also some of the things that can be done. Various software 
packages will be tested for suitability using samples of data 
from the reference files of the Akron-Summit County Public Li- 
brary (ASCPL), Akron, Ohio. Other software packages will be 
examined for suitability based on published reviews and product 
1 iterature . 

The paper will focus on librarians as the intended users of 
the automated reference files. While it may be possible to 
automate reference files using an OPAC, the proprietary nature of 
the hardware and software places it outside the context of this 
paper. It is also outside the scope of this paper to consider 
the work in artificial intelligence for automated reference 
service for end users. 

2 
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II. BACKGROUND 



Literature searches turned up some information about using 
database management software in various ways in libraries. Some 
sources concentrated on how-to information for specific programs. 
A few articles described how to choose a suitable database 
program and offered comparisons of capabilities. There are also 
numerous reviews of database management systems (DBMSs). The 
attached bibliography cites many of these works. However, most 
DBMSs are not really suitable for automating text or reference 
files. ^» ^ 

So far this researcher has not come across literature de- 
scribing comprehensive solutions for automating reference files. 
There are bits and pieces of methods that could be implemented 
haphazardly, but it would require the user to work out many of 
the details. One article proposed using the search capabilities 
of a word processing programs for a rudimentary automation system 
for a vertical file.^ A few other articles describe local news- 

c y n 

paper indexing projects. ' ' 

Additionally, new versions of software programs come out so 
frequently that the literature is practically outdated before it 
is even printed. One company has recently released a product 

Q 

based on askSam 5.0 to automate libraries. One module of the 
program is advertised as a means to "provide access to the 
contents of a vertical file . . . and . . . integrate records of your 
organization's internal work product (e.g. research)."^" 

3 
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Experiments with using expert systems in reference work have 

been under way for several years ♦ However, the literature still 

debates whether experts systems have a place in libraries yet,^^* 
12 13 

* Although the cost of these expert systems in coming down, 
they still require programming experience for setup and opera- 
tion^ 

One article described how teams of graduate students experi- 
enced in expert-systems development allotted more than four weeks 
of a project just to learn how to use the software •^^ Many if 
not most reference librarians probably do not have that amount of 
time to devote to learning a software package before beginning a 
pro ject • 



III, REFERENCE FILES AND THE DBMS 



In designing a database, one must consider the setting in 
which it will be used, who the users will be, what types of data 
it should or could contain now and in the future, and what type 
of output is desired. One also should study how data is stored, 
distributed, acquired and used in the present system if one 
exists • 

This study was conducted using the files of two Reference 
Divisions at the Main Library of the Akron-Summit County Public 
Library: Science and Technology Division (ST) and the Philosophy, 
Religion, and Education and Information Division (PRE/ID). 
Reference divisions at ASCPL are physically separated and 
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maintain separate reference files* Subject materials contained 
in each division are determined by Dewey Decimal Classification. 
Reference questions and material requests by patrons are directed 
to the appropriate Divisions by subject. 

The reference files are used by experienced reference 
librarians primarily to answer patron queries requiring either 
ready-reference or non-book printed materials for answers. None 
of these files are open-access for the public . However , patrons 
may inspect the materials in the folders of the pamphlet (verti- 
cal) files and can examine the club list in PRE/ID after a 
librarian retrieves it for them. Some of the materials in the 
pamphlet files do circulate. 

Files selected for study from the Science and Technology 
Division (ST) were the pamphlet, recipe, information, and hazard- 
ous waste facilities files. Files sampled from the Philosophy, 
Religion, and Education and Information Division (PRE/ID) were: 
pamphlet, speakers bureau, club list, officials (government), and 
newspapers files. 

A. FILE STRUCTURES 

The files consist of two basic types: index and full-text. 
The index files serve as pointers to other files or materials. 
Examples of these from ST are the pamphlet, hazardous wastes, and 
recipe files • PRE/ID* s pamphlet file is of the same type* 



The pamphlet files actually consists of two parts: the 
actual file cabinets and folders that contain the materials being 
sought and the index to the files by subject heading that serve 
as pointers to the file folders. The index files are kept on 
three-by-five cards at the reference desks in their respective 
departments. Both the file folders and the index cards are 
arranged by subject heading. 

The index file also contains SEE and SEE ALSO references to 
direct the librarian to the proper or alternative headings. 
Headings are derived primarily from Reader *s Guide to Periodical 
Literature with additional specialized or local headings added as 
needed. Most headings are in natural word orr^^r rather than 
inverted. 

ST's hazardous waste facilities file has an index file that 
consists of three-by-five cards with the names of local sites 
that have been cited in the literature as containing hazardous 
wastes. These index cards point to file folders that contain 
clippings and documents related to these sites and clean-up 
efforts. 

The recipe file in ST consists of three-by-five cards that 
index the recipes that appear in the Wednesday food section of 
the Akron Beacon Journal . The individual recipes are indexed by 
food type, (i.e., desserts). Each card contains several records 
with the name of the recipe, publication date, and page number. 
These records point to file folders that contain the actual 
sections of the newspaper organized by month and year of 



publication. The first card in the index drawer lists all the 
types of foods that the individual recipes are indexed under, 
serving as a thesaurus. There are no cross references. 

The other types of files in use may be considered as forms 
of full-text files. Examples of these are the information file 
in ST and the speakers bureau, club list, officials (government), 
and newspapers files in PRE/ID. 

All of these files are stored on three-by-five cards except 
the club list. The club list has been automated and printed out 
using PFS: File and is kept in a three-ring notebook that patrons 
can use for reference. It is arranged alphabetically by the name 
of the various clubs. The club list contains information that 
can be used to contact the organizations including officers* 
names, addresses, and phone numbers. The record structure is 
similar to what one might use to automate a membership list. 
Each record also contains a single field for "subject" (the focus 
of the club). A subject index is included for manual searching. 
The club list is seldom used in an online mode. 

The other files contain information that is needed frequent- 
ly to answer ready-reference queries or else hard-to-find infor- 
mation that was deemed worthy of recording for future use. Many 
of the entries consist of only a few words on a single card while 
others have text that extends over several cards. Filing order 
varies depending on the content of the file. However, all cards 
have single headings that they are filed under. SEE and SEE ALSO 
references may also be found in these files. 



B. USAGE PATTERNS OF THE REFERENCE FILES 



Usage patterns were observed while serving a practicum at 
ASCPL and subsequently while employed as a librarian in ST 
Division. The author also informally interviewed the experienced 
librarians in both ST and PRE/ID on how, why, and when they used 
the reference files. 

The librarians relied on their experience and judgement on 
when it would be appropriate to search the reference files. The 
files were used both as primary and supplementary sources to 
answer reference questions. They also contain materials used to 
answer frequert or unusual questions that had been researched 
previously. 

To search the files, the librarian must get up from his or 
her chair, walk over to the file (or index), and manually search 
the cards. If the first file is a pointer, the librarian must 
then walk over to the file cabinets and retrieve the file folder 
with the proper subject heading and bring it back to the patron 
(or the phone if the patron is calling in). If the first search 
was not successful, the librarian must move to the next file or 
index and perform another manual search, etc. , until the material 
is found or it is concluded the material to answer the question 
is not contained in the reference files. 

The conclusion drawn was that the files might receive more 
use if access were improved and/or easier and faster. It is not 
efficient or convenient to go from file to file and drawer to 
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drawer under the pressure of waiting on a steady stream of 
patron* If subject headings could be searched at a keyboard, the 
process could be more productive and make better use of both the 
human and information resources available* 

Instead of having to move from file to file to file, the 
librarian could perform the searches at the keyboard* If the 
answer were in a full-text file, the librarian could possibly 
satisfy the patron's request without having to move from the 
desk. If the query required materials from the pamphlet file, 
the librarian could access the subject headings in the index 
while seated and then go directly to the pamphlet file to re- 
trieve the materials. This could save both time and energy as 
well as improve the level of reference service. 

Another benefit would be to make the reference files more 
easily accessible to new or substitute librarians in a Division. 
Currently the Main Library is open for four hours on Sundays 
during the school year. On these days, it is staffed with a 
skeleton crew drawn from branches as well as the Main Library. 
Sometimes the librarians work in Divisions they are not familiar 
with. In these cases, the learning curve can be rather steep 
when trying to master the various reference files. With the 
proper software, it would be possible to view or print a list of 
all the subject headings in the reference files for scanning or 
study. Printing a list like this is a task "that would require a 
prohibitive amount of time in manual systems. 



C. SAMPLE RECORDS 



In order to determine the characteristics of the files, 
sample records were typed into a Cambridge Z-88 portable micro- 
computer and downloaded to a desktop microcomputer for conversion 
to ASCII text. The file records were subsequently examined for 
field length, number of fields per record, extensive text within 
records, and unusual characteristics. 

The longest field in the samples is seventy-nine characters 
long. The most-repeated field in one record is the one for the 
data under the subject heading of BIRDS — COLLECTIVE with thirty 
entries. The former is in the ST pamphlet file while the latter 
is in the ST Information file. The largest record found fills 
four index cards, contains 337 words, and converts to an ASCII 
text-file 2,431 bytes long. 

The formats of other files vary widely within the files, 
notably the PRE club list and speakers files. Part of this 
variance appears to be due to incomplete information supplied by 
the organizations and to some fields being pressed into use as 
comment fields during data entry. The club file sample comes 
from a printout produced with PFS: File. 

The majority of records sampled are short and uncomplicated, 
however, one does not design the database to fit the short 

records. It is necessary to search for the largest, most compli- 

16 

cated record and design for it as well as the simple ones. 

An analogy might be that the database record must contain a hole 
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big enough (and with the right cutouts) for the biggest chunk of 
data to fit in. If the hole is too small and/or the wrong shape, 
the data will not fit. But if the hole is too large, there is a 
tremendous waste of resources. 



IV. DESIRED CHARACTERISTICS OF THE PROPOSED AUTOMATED SYSTEM 

In order to simplify searching and speed up the process of 
using the reference files, it would be desirable to search the 
subject and/or index headings (or other form of main entry) of 
any or all the reference files in a Division with one search 
statement. This would eliminate having to switch from file to 
file. It would also be desirable to search the text in full-text 
files to increase recall if needed. The software should addi- 
tionally support full-text retrieval and printing of search 
results and reports. To aid in determining if a search is too 
broad or narrow, the system could show the number of retrievals 
per search. Since usage of each Division's files is confined by 
subject and data flow to within each Division, there is no 
incentive or need to be able to combine the reference files from 
all the Divisions into one all-encompassing file. 

The system should also be easy to use. Although ASCPL has 
an OPAC and a few CD-ROM workstations, personal computer (PC) 
usage is not widespread among the staff. At the present time 
there is one personal computer per Division and Branch, with the 
most sophisticated software in use being PFS: File. Based on 
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observation and informal interviews, the majority of librarians 
do not use a PC in the course of their normal duties nor are they 
comfortable with DOS* Therefore it is important that the system 
be user-friendly and can be set up and operated without requiring 
advanced computer skills* 

Another consideration is the budget. If there is little 
funding available for equipment and software, this is a con- 
straining factor. If additional hardware is needed or existing 
hardware must be upgraded, this must be considered. Cost of the 
software, technical support, training, upgrades, and consultants 
(if needed) must be added in. One must also allow for the cost 
of setup and data entry. Since funding levels are currently 
frozen at ASCPL, cost is an important factor. 

There also must be enough space and electrical outlets to 
handle additional equipment. Otherwise it may be necessary to 
buy new fui*niture and/or remodel. However, this is a hardware 
factor that must be worked out on a site-by-site basis and is 
outside the scope of this paper. 

V. EVALUATING SOFTWARE 

Once it has been determined what the system should ideally 
provide, it is necessary to study software options for implemen- 
tation. The software chosen will depend on several factors. 
Broadly speaking, it must be able to satisfactorily contain and 

12 



manipulate the data in the files, be operable by the staff, and 
fit within budgetary guidelines. 

Given the structure of the reference files, a DBMS product 
would seem to be the logical choice. DBMSs in general handle 
discrete records and files, and some are capable of handling 
full-text entries. DBMSs also come in different types (flat- 
file, relational, etc.) as well as variations on similar types 
(dBase, XBase). By examining the data structure of the files 
further, it is possible to better evaluate the DBMSs. It is 
important to keep in mind that "one of the most important issues 

of database evaluation is the consideration of trade-offs. No 

17 

database can perform best on all features... . 

In designing records in a DBMS, one examines the data to be 
entered for the longest field, the record with the most fields, 
repeating fields, and any records that have unusual or unique 
characteristics. Most DBMSs are designed with business applica- 
tions in mind, i.e., customer lists, inventories, etc. In these 
databases, the record structure is fairly uniform. However, 
reference files pose special problems. 

In reference files, records commonly have fields that repeat 
(see, see also, etc.), fields that can vary greatly in length 

1 fl 

(subject headings, etc.), and lengthy text (information files). 
If one is working with a DBMS that has fixed-length and single- 
value fields, this can necessitate record designs that are 
wasteful of resources. One must design for the largest record in 
the file and subsequently each smaller record entered will take 



up just as much storage space in the computer. This is consid- 
ered a bad design, but it is better to be liberal with field 

19 

length than to lack space for an entry at an inopportune time. 

There are some ways around fixed-length fields by using memo 
fields and/or secondary files, but this usually requires extra 
steps in data entry and retrieval. It may also hamper the 

ability to format reports. The effort may be more than many 

20 

users will want to deal with. 

The other problem with inflexible DBMSs is that even if one 
does allow for the largest record currently in the file, a new 
record encountered in the future could be even larger necessitat- 
ing a redesign of the record structure. This is not difficult 
with some DBMS packages, but others may not be so forgiving, ' 

VI, PROCEDURE FOR EVALUATION 

A, SOFTWARE SELECTION 

Software was selected for testing based largely on how well 
it met the criteria established for the system overall (see table 
1), as well as the ability to handle the structure of the records 
and files sampled. Software was also selected with an eye to 
exploring the range of solutions possible with different types of 
DBMSs, Since the author funded his own project, including buying 
all the software, it was possible to test only a few selected 
software packages due to budget limitations. 

14 
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It had become apparent from reading the literature that most 
DBMSs would be relatively unsuitable for the task* Reference 
files demand flexible record structure from a DBMS, a quality not 
readily found in many programs. * Ideally, the DBMS would 

support variable-length and multiple value fields. Otherwise, 
space would be wasted in records . However , the costs of the 
DBMSs that explicitly support these features. Advanced Revelation 
and INMAGIC, exceed the budget limitations of this project and 
perhaps the budgets of many reference departments. 

Another consideration is ease of use. If the DBMS is too 
hard to use, it would probably sit idle. Both data entry and 
searching must be relatively straight-forward, simple, and quick. 
The ability to import data, i.e., text files, downloads from 
Dialog, etc., is also desirable. Ease in exporting data as well 
as formatting and printing reports are other important consider- 
ations . 

One more factor is cost. With the tight budgets at most 
libraries, this could be very important. Some DBMSs are quite 
expensive , while others are fairly reasonable . A f ew of f er run- 
time or search-only modules at lower cost for use at workstations 
where files need not be modified. Others offer developers 
versions that permit distribution of an unlimited number of 
applications at no extra cost. 

Currently, DBMS publishers are fighting for market share and 
some programs are available at greatly reduced prices. The 



industry term for these bargains is competitive upgrade, with 
purchase at the reduced price usually requiring proof of owner- 
ship of a comparable program from another software publisher. 
While it is possible to save money in this way, one can not 
depend on these deals being available at the appropriate time or 
with the right stipulations for eligibility* 

It is also important to consider the support policies of the 
software publisher. Some provide technical support for the cost 
of a phone call while others require an annual support subscrip- 
tion that may amount to several hundred dollars. Some of the 
more complicated products may require extensive use of technical 
support so paying the support fee may be essential for optimum 
use of the DBMS.^^ 

Based on the preceding requirements and information derived 
from product reviews and literature (see Bibliography), four 
products were selected as suitable for testing: askSam 5.0, 
FoxPro 2.0, Info Select 2.0, and Q&A 4.0. These software packag- 
es represent four different types of database programs, each with 
unique features. 



Table !• — Criteria for system software 



1 


must- have 


good-to- 
have 


desirable 1 




full-text capa- 
bility 


X 







variable record 
formats in same 
1 file 




X 




IBM-compatible 


X 






cost within bud- 
get 


X 







user-f riendly 
( overall ) 


X 






easy to search 


X 






show search 
hits 




X 




free-text entry 
in any one re- 
cord 






X 


record structure 
easily modified 




X 




run— time modules 






V 

A 


search-only 
modules 




X 




variable- length 
fields 




X 




multiple- value 
fields 
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B. SOFTWARE TESTING 



1 . INSTALLATION 



For the most part, the software self-installation programs 
perform smoothly. One simply accepts the default directory or 



specifies a different one and answers a few simple questions. 
FoxPro prompts for how extensive a tutorial is desired. (The 
full tutorial requires six megabytes of storage on a hard drive). 

2. FAMILIARIZATION AND DOCUMENTATION 

All of the DBMSs came with both online and text tutorials i 
some more extensive than others. FoxPro comes with the largest 
stack of user manuals, nearly eight inches thick. This is appro- 
priate since it is by far the most complex of the four products 
tested. However, the FoxPro manuals are written at a level 
intended for an experienced database programmer. 

Info Select comes with the slimmest manual, but it is 
written in language that a beginning to intermediate computer 
user could understand* The tutorial is brief but demonstrates a 
wide range of features. 

AskSam comes with four manuals that are written for the 
computer-literate user. The User^s Guide includes extensive 
examples as well as question and answer sections. 

Q&A comes with five manuals, but four of them are slim. The 
User Manual includes detailed explanations as well as numerous 
screen shots. The majority of the User Manual should be accessi- 
ble to the beginning computer user while a few sections would be 
easier for the experienced user to comprehend. The programming 
functions are printed in the Application Programming Tools 
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Manual. The entries provide brief explanations, but some exam- 
ples are given to aid comprehension. 

3. FILE RECORD STRUCTURES AND DATABASE RECORD DESIGN 

Basically the files can be characterized as two types: index 
files and information files. The index files serve as pointers, 
i.e., pamphlet file index, while the information files contain 
all or at least some of the data on the card/record itself, i.e., 
speakers file. 

Any of these records could be contained by any of the DBMSs 
selected. The question is how and how well. While a certain 
design may work, it may not be efficient or permit necessary data 
manipulation. Another factor to consider is ease of modification 
and adaptability. For example, some of the records examined have 
notes penciled in. Ideally, a DBMS should offer the same ease of 
adding a note to an individual record as writing it with a pencil 
(metaphorically speaking). If the record is designed with a 
rigid structure, such additions may be impossible or at least 
difficult. 

Within a division, it may work quite well to relate and/or 
combine files. Each record could be tagged with the original 
file name. If desired, a master database of all the subject 
headings, titles, authority lists, etc. could be distributed to 
all reference departments within a library for purposes of 
awareness and sharing of resources. 
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4* TESTING INDIVIDUAL DATABASE PRODUCTS 



It is difficult to compare these DBMSs even when using them 
to organize the same data because the programs operate in marked- 
ly different manners. As an analogy, squirrels are good at 
climbing trees and ducks are good at crossing ponds* But if the 
duck has to climb and the squirrel swim, trying to compare their 
progress would not prove much* Each one is designed for a 
particular niche. So it is with these DBMSs. Each one produces 
the results in a different manner. 

All four products tested handle the data, searching, and 
reporting differently. Each has its strengths and weakness and 
any of them could be pressed into service to do an adequate job 
with these files. Due to the disparate natures of the DBMSs, a 
definitive benchmark test could not be devised. However, the 
ease of working with a DBMS and its flexibility emerged as highly 
important factors. These factors are somewhat subjective since 
"ease" and "flexibility" are relative terms. Each product was 
rated on a one-to-five scale on thirteen factors, with "one" 
equalling excellent and "five" indicating no implementation, etc. 
Scores were totalled with low score being best (see table 2). 

Part of the goal or this research is to find products that a 
librarian with intermediate computer literacy could set up and 
use. If a product is so complex that it requires the services of 
a programmer and/or consultant, it will not be adaptable to 
changing reference file needs. Any of these products can be used 
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by someone familiar with DOS, although they will require a 
familiarization process with DBMS basics, especially FoxPro. 
Info Select can be used almost immediately with very little 
familiarization. 

a. ASKSAM 5.0 

AskSam is a unique product that allows structured records 
and free text in the same file. One record can have as much or 
as little structure as desired and the next record can be entire- 
ly different. (However, one must keep in mind the effects on the 
types of searches and reports one wants when designing records). 

AskSam stores information in its files three ways: record 
mode, document mode, and free mode. A record can contain up to 
twenty lines of data eighty characters wide. A document consists 
of linked records with visible boundaries between the records. 
Free mode is a document with invisible boundaries. Record mode 
works best for structured data that will fit in a 1600 character 
space. Document mode works well with longer records that still 
require boundaries but may contain some free form text. Free 
mode is best used with documents where boundaries are not impor- 
tant, such as letters, reports, or lengthy text. Free mode 
permits easy scrolling. 

AskSam has three kinds of fields: explicit, implied, and 
contextual. An explicit field requires a field name followed by 
a left bracket ([)• An implied field is a character or character 
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string followed by a text string, i.e., $495. Contextual fields 
are defined by the placement and order of text, i.e., city Kent. 
It is important to note that askSam does not require the use of 
any fields whatsoever. Use of them facilitates searching and 
reporting . 

Some of the greatest strengths of askSam are its ease of 
record design, data entry, and its ^\bility to handle free form 
text and additional fields at any point. AskSam uses templates 
that the user designs as the basis for fields for a record. Each 
file can have up to ten templates. Additionally, new or differ- 
ent fields can be added to any record as needed and fields may be 
repeated. Field names or other text strings can also be assigned 
to macro keys for quick and easy use. 

Another strength of askSam is its ability to do hypertext 
and global file searches. One can designate a group of files to 
be searched, for example, all of the files in a library division. 
In the hypertext mode, one simply uses the arrow keys to move the 
highlight over any word or words in a record and askSam will 
retrieve all other occurrences of that word in the file or files. 

AskSam also does basic Boolean searches, but searches the 
terms strictly left to right with no nesting possible. It will 
also do proximity searches. Complex searches require formulating 
search statements. Printing formatted reports requires familiar- 
ity with askSam commands, but examples are included in the 
documentation. 
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AskSam is a joy to use for setting up records and data 
entry. There is no need to agonize over record structure or 
count field lengths to preassign. If an extra field crops up on 
a record while entering data, one may just type it in directly, 
even repeated field names. While this practice may not meet the 
approval of DBMS purists, it seems appropriate for the reference 
files sampled. 

Searching is easy and quick, consisting of merely typing in 
the keywords or using the hypertext mode. Occasionally the 
hypertext mode requires use of the built-in editor to trim a line 
that has pulled in the field name. 

Overall, askSam is very flexible and easy to use. Data 
entry is simple and searching is quick and easy. The only 
drawback is the need to use command statements for some functions 
such as reports. 

b. FOXPRO 2.0 

FoxPro 2.0 is the latest version of a relational (by comput- 
er industry standards) DBMS, similar to dBase IV. It can estab- 
lish a relation between the files provided there is one or more 
fields in common to link the records. 

This relational capability makes it possible to record 
unique data in one file and repetitive or bulky data in another. 
This method has been suggested by some as a reasonable course of 



action. However, it necessitates extra steps and care in 
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programming and data entry to relate the records. 

For example, to store subject headings in one file and SEE 
and SEE ALSO records in other files, several general steps are 
required. Each set of records (subject, see, and see also) would 
need matching unique record numbers to establish relations . This 
would require extreme care during data entry to avoid lost 
linkages. It also requires establishment of relations in order 
to query the databases. This may be more programming than the 
average librarian wants to do. 

One of FoxPro's strengths is variable-length searchable memo 

fields. This permits storing lengthy entries without wasting 

space on fields with little or no data. However, these memo 

fields can not be indexed and it is not simple to search them and 
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display the results. 

The memo fields can be displayed in windows that change to 
match the record that is being browsed in the active database 
file browse window. But the windows must be sized and positioned 
carefully to permit viewing of the information. Without a mouse, 
this could be quite tedious. 

Searching indexed text fields requires command line state- 
ments and/or use of pull-down query forms. FoxPro will search 
for text strings with retrieval results that can be puzzling at 
times. The sections in the manual about searching seem to be 
written for the experienced DBMS programmer. 
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Literature accompanying the FoxPro software recommends 
several programming workshops and how-to books. These would 
probably be necessary to achieve any great level of success with 
programming this DBMS for complex tasks. 

While FoxPro no doubt works quite well in business and 
industrial settings in the hands of experienced programmers, it 
does not seem well suited for inexperienced users trying to 
automate reference files. It can be pressed into service for 
this specialized task, but would likely require more effort than 
many librarians would want to put into using it on a daily basis. 



c. INFO SELECT 2.0 



Info Select is advertised and reviewed as a Personal 
Information Manager (PIM), a type of software that helps one 
organize schedules, reminders, notes, etc. into a personalized 
database. It also has a "forms" feature which can be used to 
make formatted records. As with askSam, it is not necessary to 
use fields since Info Select readily accepts free text. But for 
ease and accuracy of retrieval it is better to use fields and 
structured records . 

Each record is displayed as a window that may be edited, 
printed, sorted, or discarded. Text or fields may be added at 
any place in the record. A window grows in size to accommodate 
the text it contains. 
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Records are stored in "stacks," the equivalent to files in 
other programs. One may load different stacks as needed, but 
only one stack can be in use at a time. However, stacks can be 
combined for searches or data manipulation. There is an size 
limit of ten megabytes per stack. 

Searching consists of typing "G" (for the command "Get") and 
then the text string or words desired. Info Select displays the 
number of records remaining that match the search terms as each 
letter is typed in. Results may be displayed at any point in the 
matching with precision varying accordingly. The search term is 
highlighted within the retrieved records. Text may also be 
searched within a given window using the F2 key. Boolean NOT and 
OR are supported in a convoluted manner. 

Info Select is probably weakest in the area of reports. 
Entire stacks, individual windows, or stacks of windows retrieved 
by searching can be printed out in the same format as viewed on 
screen. They can also be output to an ASCII text file for 
manipulation. But Info Select does not format output. 

One specific report that is available is an "overview" which 
gathers the top line of each window in a stack into one large 
window (provided one has enough RAM to handle large windows). 
This might prove useful for printing a listing of all subject 
headings in a given index file, a task that would be decidedly 
time consuming with a card index. 



In addition to handling reference files, Info Select would 
be very handy to record tough reference questions and their 
answers and/or reference desk statistics. Its flexibility would 
lend itself well to that purpose. 

Overall, Info Select is extremely flexible and easy to 
search , although limited to simple searches . Fields ( and/or free 
text) can be added at any time and their length is not limited. 
However, the program has limited reporting capabilities. 

d. Q&A 4.0 

In some ways Q&A is an unlikely candidate for this task. It 
is the only traditional flat-file DBMS selected for testing. Q&A 

is included primarily for two features: keyword fields that allow 

29 

multiple entries, and the Intelligent Assistant, which is an 
artificial intelligence searching feature. In addition, Q&A 
supports large fields (32,767 characters) and records (64,512 
characters ) . 

With Q&A, one must design a record, including designating 
field types, placement, and lengths. Fields are fixed length, so 
significant storage space may be taken up by empty space in small 
records. Part of this waste is made up for with the multiple- 
entry keyword fields. These are not multiple-value fields in the 
sense that the field is repeated as often as needed in each 
record. Rather, the keyword fields must be designed long enough 



to hold all the values, which is still wasteful. The only 
savings are achieved in simplified search strategies and the 
reduced number of fields and field names required. The keyword 
fields will not yield elegant designs, but they work. 

The records are easy to design by using arrow keys to place 
the field names which are simply typed in. Field length is set 
using visual placement of markers. One then selects the field 
types to complete the basic record design. Field templates and 
restrictions on data types can be specified if desired. 

The Intelligent Assistant is designed to work with typical 
database records, i.e. names and numbers, more than textual 
records. It does a marvelous job with the employee database 
provided as a tutorial. However, it just got in the road when 
trying to search the records from the ST pamphlet file. It was 
much simpler to use the search facilities in the File module of 
Q&A than the Intelligent Assistant. Nevertheless, the Intelli- 
gent Assistant does support Boolean and complex searches and may 
work better on more complex text records such as downloads from 
Dialog, etc. with numerous field names to operate on. However, 
it is overkill for searching the simple records of an index file. 
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Table 2, — Evaluation of software tested 





askSam 


FoxPro 
2.0 


Info 
Select 


Q&A 4.0 


full-text capa- 
bility 


1 


2 


1 


2 1 


multiple record 
formats in same 
file 


1 


5 


1 


5 


IBM-compatible 


1 


1 


1 


1 1 


cost within bud- 
get 


1 

$219' 


4 

$499' 


1 

$89* 


3 1 
$255' 


j user-friendly 
1 (overall) 


2 


4 


1 


2 1 


1 easy to search 


1 


3 


1 


2 1 


1 show search 
1 hits 


1 


1 


1 


5 1 


free-text entry 
in any one re- 
cord (non-memo) 


1 


5 


1 


5 1 


record structure 
1 easily modified 


1 


2 


1 


3 1 


j run-time modules 


2 


3 


5 


5 


1 search-only 
1 modules 


1 


5 


5 




variable- length 
f ields 


1 


1 


1 


4 


j multiple-value 
1 fields 


5 


5 


5 


5 1 


1 TOTAL SCORE 

1 (LOWEST = BEST) 


19 


41 


25 


47 



Note: Grading scale is 1-5: l=excellent, 2=good, 3=fair, 4=poor, 
5=does not implement . 



Prices are from MicroWarehouse mail order catalog Vol. 3.1 circa 
September 1991. These are the selling price, not suggested list 
price . 



C. OTHER DBMS PROGRAMS OF INTEREST BUT NOT TESTED 



Three other DBMS programs, Advanced Revelation, INMAGIC, and 
Paradox 3.5, were of interest due to unique features and/or 
favorable product reviews. However, various factors eliminated 
them from testing (see table 3). 

Two of the programs. Advanced Revelation and INMAGICi may be 
appropriate for the tasks of automating reference files. Both 
programs offer multiple-value and variable-length fields which 
are highly desirable features for text databases. However, they 
were not tested due to budget restrictions and the reported 
complexities of working with these programs. 

Reviews emphasize that Advanced Revelation is appropriate 

for use by program developers only, not end users. One reviewer 
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said it might take three months to program the first project. 
It is doubtful that many librarians are going to want to devote 
that kind of time to learning a DBMS. It also is more expensive 
than FoxPro, dBase IV, or Paradox and offers no educational 
discounts . 

INMAGIC has been reviewed in the library literature as being 

22 

appropriate for automating an entire library. It can be pur- 
chased with modules to do just that. However, it is also rated 
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as difficult to work with. Perhaps one of the major drawbacks 
may be the publisher's support policy. It costs $300 per year 
per designated caller for software support. The program itself 
is also expensive. On top of that, modules for features that 



perhaps should be included with INMAGIC are sold as extras for 
several hundred dollars more. 

While both Advanced Revelation and INMAGIC have valuable 
capabilities, they do not seem to offer ease of use for database 
design and implementation or use in answering quick reference 
questions. It may be possible to use these products to automate 
the entire library and design in the capabilities to automate the 
reference files. However, it may not be feasible to use them for 
a stand-alone project for the reference files. The cost and 
effort would probably be too great. 

Paradox 3.5 has received high marks in product reviews. 
However, it was eliminated from consideration for this type of 
application because does not implement memo fields. This would 
make it difficult to handle lengthy (more than 254 characters) 
text entries. While the task could be accomplished by using 
repeated fields,. it would require more complex programming. For 
example, to include the longest text field encountered in the 
files sampled would require ten fields, i.e., TEXTl , TEXT 2 , 
TEXTS, ... . This could turn into a programming and data-*entry 
quagmire. It is much simpler to use products with memo fields or 
similar capabilities when dealing with extensive text. 
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Table 3, — Evaluation of DBMS programs not selected for testing 
(based on product reviews and literature) 





Advanced 
Revelation 


INMAGIC 


PaTadox 1 

JL VA X. \Jb \JL \^ #V ■ 

3.5 


j full-text capability 


1 


1 


4 1 


1 variable record formats 
1 in same file 


5 


5 


5 


1 IBM-compatible 


1 


1 


1 1 


1 cost within budget 


4 

$749* 


4 

$995+* 


4 1 

$519* 1 


I user— friendly (overall) 


4 


4 


2 1 


1 easy to search 


b 


b 


2 j 


1 show search 
1 hits 


b 


b 


b 


j free-text entry in any 
1 one record (non-memo) 


b 


b 


5 


1 record structure easily 
{ modified 


3 


L 

D 


2 1 


j run-time modules 


5 


5 


5 [ 


j search-only 
1 modules 


5 


5 


5 1 


j variable-length fields 


1 


1 


1 1 


j multiple-value fields 


1 


1 


5 1 



Note: Grading scale is 1-5: l=excellent, 2=good, 3=fair, 4=poor, 
5=:does not implement. 

^Prices obtained from various sources circa September 
1991. These are approximate selling prices. 

'^Undetermined 

D. SOME SOFTWARE PROGRAMS ELIMINATED FROM CONSIDERATION 

A few programs were eliminated from consideration due to 
lack of features, poor reviews, and/or lack of funds for purchase 

32 




and testing. This does not mean they can not be used, just that 

using them could be more difficult or less satisfactory. 

DBase IV was not tested due to the lackluster reviews it has 
35 

received. While it does have variable-length and searchable 
memo fields, it is not a state-of-the-art DBMS. If one already 
owns and is familiar with dBase IV, it certainly could be pressed 
into service. However, there are no compelling reasons to go out 
and buy a new copy of it. 

Some may argue that Zylndex could be used for automating 
reference files. Zylndex is not a DBMS program. Rather it is a 
text-retrieval program. It is used to index existing text files 
on computer storage devices and then to search for text strings. 
The retrieval results can then be viewed and/or pasted into a 
word processing document, etc. The text files do not have to be 
structured . 

While this type of program may suit some individuals and 
uses, it does not facilitate printing formatted reports. (Unless 
one wants to spend time cutting and pasting to format search re- 
sults) . Also, since the text files being searched do not have to 
be structured, search results could be uneven. But if one 
already has large quantities of text files stored on computers 
and does not want to input them into a DBMS, Zylndex may be a 
viable alternative . 
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VII. CONCLUSIONS AND RECOMMENDATIONS 



A. SOFTWARE 



Which database software program to use depends on many 
factors: types of data, usage patterns, data flow, budget, level 
of computer literacy, the task at hand, etc. Even after choosing 
a DBMS programs, designing databases introduces a wide array of 
variables. There is no single correct DBMS program or database 
design. However, some are more appropriate and easier to use 
than others. 

Given the wide array of data and data structures in the 
reference files sampled (and the even wider array that may exist 
in other libraries), a DBMS should offer extreme flexibility. 
Also, given the varying levels of computer literacy probably 
existent among staff members, the program should be easy to 
understand and use. Low cost is also an important factor in the 
case of ASCPL. 

The following evaluations are given in order of total score. 
Of the programs tested, askSam offers the best implementation of 
these features along with the ability to produce formatted 
reports (see table 2 for scores). The program will handle text 
of any length as well as displaying graphics files. Ease of 
searching is excellent, especially when using the hypertext 
feature. AskSam can also do global searches on several files at 
once . 
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Database design with askSam is easy and forgiving. Fields 
can be variable length as well as repeated. Extra fields and/or 
free text can be added or subtracted from any record at any 
point. By using document or free modes, records of any length 
can be accommodated. Databases may be indexed if desired, but it 
is not required. Data entry is simple and direct. 

AskSam is also relatively inexpensive with a street price of 
about $220. Search-only modules are available for substantially 
less, as well as a developers edition that can be used to design 
and distribute applications. 

Probably the weakest point of askSam is formatting print- 
outs. But with some study of the command syntax, it is possible 
to produce the desired output and the commands can be saved as a 
macro for future use. It is questionable how often one ma^^^ need 
to print out extensive reports from the reference files. 

The next most flexible and easy to use program is Info 
Select. Data entry can be nearly as simple as typing if struc- 
tured records are not required. If structure is needed, record 
templates can be easily designed and called up for data entry if 
desired. Additional text or fields can be added to any record as 
with askSam. Searching is done with a one-keystroke command plus 
the search string. Info Select may be considered a little strong 
on recall and a little weak on precision. However, with a 
slightly more involved search process one can compensate for 
this . 



Info Select could be very useful at the reference desk for 
quick notes or personalized databases* Each librarian could 
easily have his or her own disk of information (book selectioni 
serials, bibliographies, etc*) to work on and use while on desk 
duty. Info Select can easily print out search results or an 
entire database* However, it is not easy to reformat records for 
printouts or reports. Info Select is inexpensive with a street 
price of about $100. It runs as either a memory-resident or 
stand-alone program. 

FoxPro has impressive capabilities. Unfortunately, it takes 
a programmer to bring them out. The variable-length, search- 
able memo fields make text retrieval easy in theory. Implementa- 
tion is another matter. Too many steps are required to search 
the memo fields easily, quickly, or accurately. 

It is also possible to use the relational capabilities of 
FoxPro to handle repeated fields. But it takes more programming 
and bother than most librarians may wish to engage in. FoxPro is 
also the most expensive product tested. 

Q&A 4.0 would be fine for simple office automation needs. 
However, it is out of its element with text databases. Fields 
are not variable length and the keyword fields are not true 
multiple-value fields. As a result, unused or partially filled 
fields would take up valuable storage space. Record design 
requires careful planning for optimization. Fields will appear 
in all records in a database and additional fields may not be 
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added to a single record at a time. Free text must fit within a 
specified field length. 

The Intelligent Assistant seems to be optimized for search- 
ing traditional databases, i.e., addresses, inventories, etc. No 
doubt it is quite useful in business settings, but it did not 
perform well searching extensive text. In any event, it requires 
entering a command-like search string. It is probably most 
useful for simple searches by business people who do not want to 
take the time to understand how a database works. Basically, all 
the Intelligent Assistant provides is some automatic substitution 
of command synonyms. 



B. OTHER FACTORS TO CONSIDER 



Ideally, each reference desk position would be equipped with 
a microcomputer for searching the reference files while seated. 
It may be possible to multi-task and run the OPAC in one window 
and the reference DBMS in another window. This would depend on 
proprietary compatibility aspects of the OPAC involved with 
software such as DESQview or DESQview/X running on the microcom- 
puter. If compatibility issues were worked out, multi-tasking 
could eliminate the need for placing both a dedicated OPAC 
terminal and a PC at each position. This would save on equipment 
costs and desk space. However, evaluation of this aspect is 
beyond the budget and scope of this paper. 
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staff training will also take some time. This may be 
balanced against the time it takes a newly-hired or substitute 
librarian to become familiar with the manual files. Automated 
subject retrieval may prove significantly faster overall than the 
manual system for familiarization and use of vertical files, 
etc., even when training time is included. 

While some personnel may balk at learning another automation 
system in addition to the OPAC, others may welcome it, especially 
new hires who would not be familiar with the local manual system. 
On the other hand, OPACs have been around for a few years and 
probably few librarians would choose to go back to card catalogs. 
Perhaps the best way to win over reluctant staff members would be 
with demonstrations . 



C. OVERALL RECOMMENDATIONS 



Automating the reference files could be a worthwhile project 
with sufficient funds and suitable software and equipment. It 
will take time and effort to design and implement automated 
reference file system. However, once it is operating, more time 
and effort will be saved in return. It could also result in 
better utilization of information sources and more efficient 
service. 
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APPENDIX 



SOFTWARE PRODUCTS AND PUBLISHERS 



Advanced Revelation 2.03 

Revelation Technologies Inc. 

2 Park Ave . 

New York, N.Y. 10016 

(800)262-4747 

askSam 5.0 
askSam Systems 
119 S. Washington 
Perry, FL 32347-1428 
(800 )800-1997 

DESQview 

Quarterdeck Office Systems 
150 Pico Blvd. 
Santa Monica, CA 90405 
(213 )392-9701 



FoxPro 2.0 

Fox Software, Inc. 

134 W. South Boundary 

Perrysburg, OH 43551 

(419)874-0162 

Info Select 2.0 
Micro Logic Corp. 
P.O. Box 174 
Hackensack, NJ 07602 
(201 )342-6518 

INMAGIC 7.2 

Inmagic Inc. 

2067 Massachusetts Ave. 

Cambridge, MA 02140 

(617)661-8124 

Q&A 4.0 
Symantec Corp. 
10201 Torre Ave. 
Cupertino, CA 95014 
(408)253-9600 
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